Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 582 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 63.8 KiB |
| Average record size in memory | 112.2 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 10 |
category has constant value "성인종합영양제" | Constant |
TV is highly correlated with cable and 2 other fields | High correlation |
cable is highly correlated with TV and 2 other fields | High correlation |
jong is highly correlated with TV and 2 other fields | High correlation |
sum is highly correlated with TV and 2 other fields | High correlation |
TV is highly correlated with cable and 2 other fields | High correlation |
cable is highly correlated with TV and 3 other fields | High correlation |
jong is highly correlated with TV and 3 other fields | High correlation |
UCC is highly correlated with cable and 2 other fields | High correlation |
sum is highly correlated with TV and 3 other fields | High correlation |
TV is highly correlated with cable and 2 other fields | High correlation |
cable is highly correlated with TV and 2 other fields | High correlation |
jong is highly correlated with TV and 2 other fields | High correlation |
sum is highly correlated with TV and 2 other fields | High correlation |
item is highly correlated with category | High correlation |
product is highly correlated with advertiser and 1 other fields | High correlation |
advertiser is highly correlated with product and 1 other fields | High correlation |
category is highly correlated with item and 2 other fields | High correlation |
advertiser is highly correlated with product and 1 other fields | High correlation |
product is highly correlated with advertiser and 2 other fields | High correlation |
date is highly correlated with product | High correlation |
item is highly correlated with cable and 3 other fields | High correlation |
TV is highly correlated with cable and 4 other fields | High correlation |
radio is highly correlated with advertiser and 1 other fields | High correlation |
newspaper is highly correlated with magazine | High correlation |
magazine is highly correlated with newspaper and 1 other fields | High correlation |
cable is highly correlated with item and 4 other fields | High correlation |
jong is highly correlated with item and 4 other fields | High correlation |
UCC is highly correlated with item and 4 other fields | High correlation |
banner is highly correlated with TV and 2 other fields | High correlation |
sum is highly correlated with item and 5 other fields | High correlation |
banner is highly skewed (γ1 = 23.71926611) | Skewed |
TV has 342 (58.8%) zeros | Zeros |
radio has 540 (92.8%) zeros | Zeros |
newspaper has 387 (66.5%) zeros | Zeros |
magazine has 432 (74.2%) zeros | Zeros |
cable has 252 (43.3%) zeros | Zeros |
jong has 282 (48.5%) zeros | Zeros |
UCC has 354 (60.8%) zeros | Zeros |
banner has 569 (97.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-02 04:37:23.853301 |
|---|---|
| Analysis finished | 2022-05-02 04:37:34.089491 |
| Duration | 10.24 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.7 KiB |
| 성인종합영양제 |
|---|
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 성인종합영양제 |
|---|---|
| 2nd row | 성인종합영양제 |
| 3rd row | 성인종합영양제 |
| 4th row | 성인종합영양제 |
| 5th row | 성인종합영양제 |
Common Values
| Value | Count | Frequency (%) |
| 성인종합영양제 | 582 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 성인종합영양제 | 582 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.7 KiB |
| GC녹십자 | |
|---|---|
| 일동제약 | |
| 글락소스미스클라인컨슈머헬스케어코리아 | |
| 한국화이자제약(주) | |
| 구주제약 | |
| Other values (2) |
Length
| Max length | 19 |
|---|---|
| Median length | 5 |
| Mean length | 7.378006873 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 구주제약 |
|---|---|
| 2nd row | 구주제약 |
| 3rd row | 구주제약 |
| 4th row | 구주제약 |
| 5th row | 구주제약 |
Common Values
| Value | Count | Frequency (%) |
| GC녹십자 | 208 | |
| 일동제약 | 168 | |
| 글락소스미스클라인컨슈머헬스케어코리아 | 94 | |
| 한국화이자제약(주) | 58 | 10.0% |
| 구주제약 | 21 | 3.6% |
| 삼진제약 | 21 | 3.6% |
| 유유제약 | 12 | 2.1% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| gc녹십자 | 208 | |
| 일동제약 | 168 | |
| 글락소스미스클라인컨슈머헬스케어코리아 | 94 | |
| 한국화이자제약(주 | 58 | 10.0% |
| 구주제약 | 21 | 3.6% |
| 삼진제약 | 21 | 3.6% |
| 유유제약 | 12 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 20 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.7 KiB |
| 아로나민골드 | |
|---|---|
| GC녹십자비맥스메타정 | |
| 글락소스미스클라인센트룸 | |
| 아로나민씨플러스 | |
| GC녹십자비맥스골드 | |
| Other values (15) |
Length
| Max length | 20 |
|---|---|
| Median length | 10 |
| Mean length | 9.651202749 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 구주알코덱스 |
|---|---|
| 2nd row | 구주알코덱스 |
| 3rd row | 구주알코덱스 |
| 4th row | 구주알코덱스 |
| 5th row | 구주알코덱스 |
Common Values
| Value | Count | Frequency (%) |
| 아로나민골드 | 100 | |
| GC녹십자비맥스메타정 | 75 | |
| 글락소스미스클라인센트룸 | 69 | |
| 아로나민씨플러스 | 62 | |
| GC녹십자비맥스골드 | 60 | |
| 한국화이자센트룸포맨&포우먼 | 39 | 6.7% |
| GC녹십자비맥스액티브정 | 36 | 6.2% |
| GC녹십자비맥스 | 36 | 6.2% |
| 구주알코덱스 | 21 | 3.6% |
| 삼진제약트레스탄 | 21 | 3.6% |
| Other values (10) | 63 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 아로나민골드 | 100 | |
| gc녹십자비맥스메타정 | 75 | |
| 글락소스미스클라인센트룸 | 69 | |
| 아로나민씨플러스 | 62 | |
| gc녹십자비맥스골드 | 60 | |
| 한국화이자센트룸포맨&포우먼 | 39 | 6.7% |
| gc녹십자비맥스액티브정 | 36 | 6.2% |
| gc녹십자비맥스 | 36 | 6.2% |
| 구주알코덱스 | 21 | 3.6% |
| 삼진제약트레스탄 | 21 | 3.6% |
| Other values (10) | 63 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 38 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.05285223 |
| Minimum | 19.01 |
|---|---|
| Maximum | 22.02 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 19.01 |
|---|---|
| 5-th percentile | 19.03 |
| Q1 | 19.1 |
| median | 20.06 |
| Q3 | 21.02 |
| 95-th percentile | 21.1 |
| Maximum | 22.02 |
| Range | 3.01 |
| Interquartile range (IQR) | 1.92 |
Descriptive statistics
| Standard deviation | 0.8482766424 |
|---|---|
| Coefficient of variation (CV) | 0.04230204425 |
| Kurtosis | -1.068898989 |
| Mean | 20.05285223 |
| Median Absolute Deviation (MAD) | 0.96 |
| Skewness | 0.2530244151 |
| Sum | 11670.76 |
| Variance | 0.7195732621 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=38)
| Value | Count | Frequency (%) |
| 20.06 | 24 | 4.1% |
| 20.01 | 22 | 3.8% |
| 21.02 | 22 | 3.8% |
| 19.07 | 21 | 3.6% |
| 19.06 | 20 | 3.4% |
| 19.12 | 20 | 3.4% |
| 19.08 | 20 | 3.4% |
| 20.1 | 19 | 3.3% |
| 21.08 | 19 | 3.3% |
| 20.05 | 18 | 3.1% |
| Other values (28) | 377 |
| Value | Count | Frequency (%) |
| 19.01 | 14 | |
| 19.02 | 11 | |
| 19.03 | 15 | |
| 19.04 | 15 | |
| 19.05 | 14 | |
| 19.06 | 20 | |
| 19.07 | 21 | |
| 19.08 | 20 | |
| 19.09 | 15 | |
| 19.1 | 18 |
| Value | Count | Frequency (%) |
| 22.02 | 9 | |
| 22.01 | 6 | 1.0% |
| 21.12 | 6 | 1.0% |
| 21.11 | 6 | 1.0% |
| 21.1 | 12 | |
| 21.09 | 12 | |
| 21.08 | 19 | |
| 21.07 | 12 | |
| 21.06 | 14 | |
| 21.05 | 14 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.7 KiB |
| 횟수 | |
|---|---|
| 금액 | |
| 노출량 |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.307560137 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 금액 |
|---|---|
| 2nd row | 횟수 |
| 3rd row | 노출량 |
| 4th row | 금액 |
| 5th row | 횟수 |
Common Values
| Value | Count | Frequency (%) |
| 횟수 | 202 | |
| 금액 | 201 | |
| 노출량 | 179 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 횟수 | 202 | |
| 금액 | 201 | |
| 노출량 | 179 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 238 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 92539.0378 |
| Minimum | 0 |
|---|---|
| Maximum | 3660470 |
| Zeros | 342 |
| Zeros (%) | 58.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2553.75 |
| 95-th percentile | 679057.05 |
| Maximum | 3660470 |
| Range | 3660470 |
| Interquartile range (IQR) | 2553.75 |
Descriptive statistics
| Standard deviation | 306616.3064 |
|---|---|
| Coefficient of variation (CV) | 3.313372537 |
| Kurtosis | 46.48892914 |
| Mean | 92539.0378 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.664565475 |
| Sum | 53857720 |
| Variance | 9.401355937 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 342 | |
| 4020 | 2 | 0.3% |
| 203 | 2 | 0.3% |
| 9390 | 2 | 0.3% |
| 397 | 1 | 0.2% |
| 4785 | 1 | 0.2% |
| 155027 | 1 | 0.2% |
| 155 | 1 | 0.2% |
| 2325 | 1 | 0.2% |
| 802312 | 1 | 0.2% |
| Other values (228) | 228 |
| Value | Count | Frequency (%) |
| 0 | 342 | |
| 1 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 15 | 1 | 0.2% |
| 23 | 1 | 0.2% |
| 55 | 1 | 0.2% |
| 56 | 1 | 0.2% |
| 84 | 1 | 0.2% |
| 105 | 1 | 0.2% |
| 131 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 3660470 | 1 | |
| 2851839 | 1 | |
| 1622807 | 1 | |
| 1564814 | 1 | |
| 1556747 | 1 | |
| 1414827 | 1 | |
| 1372386 | 1 | |
| 1150273 | 1 | |
| 1109730 | 1 | |
| 1041134 | 1 |
| Distinct | 43 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1907.805842 |
| Minimum | 0 |
|---|---|
| Maximum | 124405 |
| Zeros | 540 |
| Zeros (%) | 92.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 471.2 |
| Maximum | 124405 |
| Range | 124405 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 13054.10718 |
|---|---|
| Coefficient of variation (CV) | 6.842471542 |
| Kurtosis | 59.15175943 |
| Mean | 1907.805842 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.676445452 |
| Sum | 1110343 |
| Variance | 170409714.3 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=43)
| Value | Count | Frequency (%) |
| 0 | 540 | |
| 256 | 1 | 0.2% |
| 5100 | 1 | 0.2% |
| 79168 | 1 | 0.2% |
| 284 | 1 | 0.2% |
| 5680 | 1 | 0.2% |
| 77643 | 1 | 0.2% |
| 252 | 1 | 0.2% |
| 5040 | 1 | 0.2% |
| 77576 | 1 | 0.2% |
| Other values (33) | 33 | 5.7% |
| Value | Count | Frequency (%) |
| 0 | 540 | |
| 68 | 1 | 0.2% |
| 94 | 1 | 0.2% |
| 97 | 1 | 0.2% |
| 124 | 1 | 0.2% |
| 252 | 1 | 0.2% |
| 255 | 1 | 0.2% |
| 256 | 1 | 0.2% |
| 284 | 1 | 0.2% |
| 399 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 124405 | 1 | |
| 113545 | 1 | |
| 111743 | 1 | |
| 109798 | 1 | |
| 108163 | 1 | |
| 105083 | 1 | |
| 79168 | 1 | |
| 78516 | 1 | |
| 77643 | 1 | |
| 77576 | 1 |
| Distinct | 129 |
|---|---|
| Distinct (%) | 22.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3897.43299 |
| Minimum | 0 |
|---|---|
| Maximum | 240745 |
| Zeros | 387 |
| Zeros (%) | 66.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 14 |
| 95-th percentile | 17490.15 |
| Maximum | 240745 |
| Range | 240745 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 18974.27627 |
|---|---|
| Coefficient of variation (CV) | 4.868403463 |
| Kurtosis | 89.78276971 |
| Mean | 3897.43299 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.790451178 |
| Sum | 2268306 |
| Variance | 360023159.8 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 387 | |
| 6 | 7 | 1.2% |
| 7 | 6 | 1.0% |
| 1 | 6 | 1.0% |
| 3 | 6 | 1.0% |
| 2 | 5 | 0.9% |
| 1134 | 5 | 0.9% |
| 1323 | 4 | 0.7% |
| 5 | 4 | 0.7% |
| 945 | 3 | 0.5% |
| Other values (119) | 149 | 25.6% |
| Value | Count | Frequency (%) |
| 0 | 387 | |
| 1 | 6 | 1.0% |
| 2 | 5 | 0.9% |
| 3 | 6 | 1.0% |
| 4 | 2 | 0.3% |
| 5 | 4 | 0.7% |
| 6 | 7 | 1.2% |
| 7 | 6 | 1.0% |
| 8 | 2 | 0.3% |
| 9 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 240745 | 1 | |
| 205770 | 1 | |
| 192670 | 1 | |
| 166890 | 1 | |
| 84915 | 1 | |
| 74417 | 1 | |
| 69574 | 1 | |
| 59940 | 1 | |
| 58213 | 1 | |
| 53792 | 1 |
| Distinct | 27 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 424.7250859 |
| Minimum | 0 |
|---|---|
| Maximum | 17000 |
| Zeros | 432 |
| Zeros (%) | 74.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 4000 |
| Maximum | 17000 |
| Range | 17000 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1657.763595 |
|---|---|
| Coefficient of variation (CV) | 3.903145 |
| Kurtosis | 30.96986809 |
| Mean | 424.7250859 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.061401683 |
| Sum | 247190 |
| Variance | 2748180.138 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=27)
| Value | Count | Frequency (%) |
| 0 | 432 | |
| 2 | 44 | 7.6% |
| 1 | 38 | 6.5% |
| 3 | 12 | 2.1% |
| 2000 | 12 | 2.1% |
| 4000 | 10 | 1.7% |
| 6000 | 4 | 0.7% |
| 5 | 4 | 0.7% |
| 4500 | 3 | 0.5% |
| 8000 | 2 | 0.3% |
| Other values (17) | 21 | 3.6% |
| Value | Count | Frequency (%) |
| 0 | 432 | |
| 1 | 38 | 6.5% |
| 2 | 44 | 7.6% |
| 3 | 12 | 2.1% |
| 4 | 2 | 0.3% |
| 5 | 4 | 0.7% |
| 1800 | 2 | 0.3% |
| 2000 | 12 | 2.1% |
| 2500 | 1 | 0.2% |
| 3500 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 17000 | 1 | |
| 11000 | 1 | |
| 10500 | 1 | |
| 10200 | 1 | |
| 10000 | 1 | |
| 9800 | 1 | |
| 8500 | 1 | |
| 8000 | 2 | |
| 7000 | 1 | |
| 6500 | 2 |
| Distinct | 327 |
|---|---|
| Distinct (%) | 56.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102401.3162 |
| Minimum | 0 |
|---|---|
| Maximum | 2114145 |
| Zeros | 252 |
| Zeros (%) | 43.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1120.5 |
| Q3 | 47722.5 |
| 95-th percentile | 676126.35 |
| Maximum | 2114145 |
| Range | 2114145 |
| Interquartile range (IQR) | 47722.5 |
Descriptive statistics
| Standard deviation | 246695.7151 |
|---|---|
| Coefficient of variation (CV) | 2.409106879 |
| Kurtosis | 13.5076444 |
| Mean | 102401.3162 |
| Median Absolute Deviation (MAD) | 1120.5 |
| Skewness | 3.282722085 |
| Sum | 59597566 |
| Variance | 6.085877586 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 252 | |
| 15 | 3 | 0.5% |
| 1 | 2 | 0.3% |
| 3057 | 2 | 0.3% |
| 1919 | 1 | 0.2% |
| 4455 | 1 | 0.2% |
| 297 | 1 | 0.2% |
| 114258 | 1 | 0.2% |
| 8205 | 1 | 0.2% |
| 547 | 1 | 0.2% |
| Other values (317) | 317 |
| Value | Count | Frequency (%) |
| 0 | 252 | |
| 1 | 2 | 0.3% |
| 8 | 1 | 0.2% |
| 15 | 3 | 0.5% |
| 26 | 1 | 0.2% |
| 29 | 1 | 0.2% |
| 37 | 1 | 0.2% |
| 86 | 1 | 0.2% |
| 100 | 1 | 0.2% |
| 120 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 2114145 | 1 | |
| 1514175 | 1 | |
| 1450474 | 1 | |
| 1154806 | 1 | |
| 1105133 | 1 | |
| 1073588 | 1 | |
| 1064678 | 1 | |
| 953719 | 1 | |
| 940881 | 1 | |
| 919162 | 1 |
| Distinct | 288 |
|---|---|
| Distinct (%) | 49.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67332.03436 |
| Minimum | 0 |
|---|---|
| Maximum | 882647 |
| Zeros | 282 |
| Zeros (%) | 48.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 48 |
| Q3 | 5846.25 |
| 95-th percentile | 506917.25 |
| Maximum | 882647 |
| Range | 882647 |
| Interquartile range (IQR) | 5846.25 |
Descriptive statistics
| Standard deviation | 169069.5964 |
|---|---|
| Coefficient of variation (CV) | 2.510983041 |
| Kurtosis | 5.969114602 |
| Mean | 67332.03436 |
| Median Absolute Deviation (MAD) | 48 |
| Skewness | 2.618236492 |
| Sum | 39187244 |
| Variance | 2.858452843 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 282 | |
| 204 | 2 | 0.3% |
| 286 | 2 | 0.3% |
| 2280 | 2 | 0.3% |
| 3540 | 2 | 0.3% |
| 5430 | 2 | 0.3% |
| 362 | 2 | 0.3% |
| 500 | 2 | 0.3% |
| 414 | 2 | 0.3% |
| 3060 | 2 | 0.3% |
| Other values (278) | 282 |
| Value | Count | Frequency (%) |
| 0 | 282 | |
| 1 | 2 | 0.3% |
| 2 | 1 | 0.2% |
| 15 | 2 | 0.3% |
| 28 | 1 | 0.2% |
| 30 | 1 | 0.2% |
| 38 | 1 | 0.2% |
| 44 | 1 | 0.2% |
| 52 | 1 | 0.2% |
| 53 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 882647 | 1 | |
| 851995 | 1 | |
| 825515 | 1 | |
| 740785 | 1 | |
| 727150 | 1 | |
| 724462 | 1 | |
| 721839 | 1 | |
| 704526 | 1 | |
| 652259 | 1 | |
| 651574 | 1 |
| Distinct | 196 |
|---|---|
| Distinct (%) | 33.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10145.73368 |
| Minimum | 0 |
|---|---|
| Maximum | 200322 |
| Zeros | 354 |
| Zeros (%) | 60.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 79.75 |
| 95-th percentile | 68930.15 |
| Maximum | 200322 |
| Range | 200322 |
| Interquartile range (IQR) | 79.75 |
Descriptive statistics
| Standard deviation | 30172.57874 |
|---|---|
| Coefficient of variation (CV) | 2.973917875 |
| Kurtosis | 15.4026131 |
| Mean | 10145.73368 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.815975053 |
| Sum | 5904817 |
| Variance | 910384507.6 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 354 | |
| 33 | 7 | 1.2% |
| 48 | 3 | 0.5% |
| 49 | 3 | 0.5% |
| 45 | 3 | 0.5% |
| 3 | 3 | 0.5% |
| 32 | 2 | 0.3% |
| 42 | 2 | 0.3% |
| 4 | 2 | 0.3% |
| 88 | 2 | 0.3% |
| Other values (186) | 201 |
| Value | Count | Frequency (%) |
| 0 | 354 | |
| 2 | 1 | 0.2% |
| 3 | 3 | 0.5% |
| 4 | 2 | 0.3% |
| 5 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| 8 | 1 | 0.2% |
| 11 | 1 | 0.2% |
| 12 | 1 | 0.2% |
| 13 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 200322 | 1 | |
| 186310 | 1 | |
| 184981 | 1 | |
| 182890 | 1 | |
| 171888 | 1 | |
| 167842 | 1 | |
| 163406 | 1 | |
| 150099 | 1 | |
| 143653 | 1 | |
| 137191 | 1 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.0412371 |
| Minimum | 0 |
|---|---|
| Maximum | 61000 |
| Zeros | 569 |
| Zeros (%) | 97.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 61000 |
| Range | 61000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2543.12093 |
|---|---|
| Coefficient of variation (CV) | 21.18539421 |
| Kurtosis | 568.1774826 |
| Mean | 120.0412371 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.71926611 |
| Sum | 69864 |
| Variance | 6467464.064 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) |
| 0 | 569 | |
| 2 | 4 | 0.7% |
| 1 | 2 | 0.3% |
| 28 | 1 | 0.2% |
| 18 | 1 | 0.2% |
| 50 | 1 | 0.2% |
| 24 | 1 | 0.2% |
| 2518 | 1 | 0.2% |
| 61000 | 1 | 0.2% |
| 6216 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 569 | |
| 1 | 2 | 0.3% |
| 2 | 4 | 0.7% |
| 18 | 1 | 0.2% |
| 24 | 1 | 0.2% |
| 28 | 1 | 0.2% |
| 50 | 1 | 0.2% |
| 2518 | 1 | 0.2% |
| 6216 | 1 | 0.2% |
| 61000 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 61000 | 1 | 0.2% |
| 6216 | 1 | 0.2% |
| 2518 | 1 | 0.2% |
| 50 | 1 | 0.2% |
| 28 | 1 | 0.2% |
| 24 | 1 | 0.2% |
| 18 | 1 | 0.2% |
| 2 | 4 | 0.7% |
| 1 | 2 | 0.3% |
| 0 | 569 |
| Distinct | 493 |
|---|---|
| Distinct (%) | 84.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 278768.1271 |
| Minimum | 1 |
|---|---|
| Maximum | 4636206 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 729.75 |
| median | 6576.5 |
| Q3 | 78860.25 |
| 95-th percentile | 1913842.75 |
| Maximum | 4636206 |
| Range | 4636205 |
| Interquartile range (IQR) | 78130.5 |
Descriptive statistics
| Standard deviation | 673950.856 |
|---|---|
| Coefficient of variation (CV) | 2.417603701 |
| Kurtosis | 9.457623496 |
| Mean | 278768.1271 |
| Median Absolute Deviation (MAD) | 6573.5 |
| Skewness | 2.991150483 |
| Sum | 162243050 |
| Variance | 4.542097563 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2 | 20 | 3.4% |
| 1 | 19 | 3.3% |
| 3 | 10 | 1.7% |
| 8 | 6 | 1.0% |
| 2000 | 5 | 0.9% |
| 6 | 4 | 0.7% |
| 9 | 3 | 0.5% |
| 4352 | 3 | 0.5% |
| 272 | 3 | 0.5% |
| 10 | 3 | 0.5% |
| Other values (483) | 506 |
| Value | Count | Frequency (%) |
| 1 | 19 | |
| 2 | 20 | |
| 3 | 10 | |
| 4 | 3 | 0.5% |
| 5 | 1 | 0.2% |
| 6 | 4 | 0.7% |
| 7 | 2 | 0.3% |
| 8 | 6 | 1.0% |
| 9 | 3 | 0.5% |
| 10 | 3 | 0.5% |
| Value | Count | Frequency (%) |
| 4636206 | 1 | |
| 3865603 | 1 | |
| 3761609 | 1 | |
| 3468368 | 1 | |
| 3233193 | 1 | |
| 3134494 | 1 | |
| 2994362 | 1 | |
| 2891034 | 1 | |
| 2828142 | 1 | |
| 2638408 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| category | advertiser | product | date | item | TV | radio | newspaper | magazine | cable | jong | UCC | banner | sum | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.10 | 금액 | 0 | 124405 | 0 | 0 | 0 | 0 | 0 | 0 | 124405 |
| 1 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.10 | 횟수 | 0 | 498 | 0 | 0 | 0 | 0 | 0 | 0 | 498 |
| 2 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.10 | 노출량 | 0 | 9960 | 0 | 0 | 0 | 0 | 0 | 0 | 9960 |
| 3 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.11 | 금액 | 0 | 113545 | 0 | 0 | 0 | 0 | 0 | 0 | 113545 |
| 4 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.11 | 횟수 | 0 | 437 | 0 | 0 | 0 | 0 | 0 | 0 | 437 |
| 5 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.11 | 노출량 | 0 | 8740 | 0 | 0 | 0 | 0 | 0 | 0 | 8740 |
| 6 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.12 | 금액 | 0 | 111743 | 0 | 0 | 0 | 0 | 0 | 0 | 111743 |
| 7 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.12 | 횟수 | 0 | 473 | 0 | 0 | 0 | 0 | 0 | 0 | 473 |
| 8 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 19.12 | 노출량 | 0 | 9460 | 0 | 0 | 0 | 0 | 0 | 0 | 9460 |
| 9 | 성인종합영양제 | 구주제약 | 구주알코덱스 | 20.01 | 금액 | 0 | 109798 | 0 | 0 | 0 | 0 | 0 | 0 | 109798 |
Last rows
| category | advertiser | product | date | item | TV | radio | newspaper | magazine | cable | jong | UCC | banner | sum | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 572 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.01 | 금액 | 0 | 0 | 0 | 8000 | 0 | 0 | 0 | 0 | 8000 |
| 573 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.01 | 횟수 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 |
| 574 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.01 | 노출량 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 |
| 575 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.02 | 금액 | 0 | 0 | 0 | 10500 | 0 | 0 | 0 | 0 | 10500 |
| 576 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.02 | 횟수 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 3 |
| 577 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.02 | 노출량 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 3 |
| 578 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.03 | 금액 | 0 | 0 | 0 | 4000 | 0 | 0 | 0 | 0 | 4000 |
| 579 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.03 | 횟수 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 |
| 580 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스액티브정 | 20.03 | 노출량 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 |
| 581 | 성인종합영양제 | GC녹십자 | GC녹십자비맥스정 | 21.02 | 횟수 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |